Fast speech recognition for voice destination entry in a car navigation system
نویسندگان
چکیده
In this paper, we introduce a multi-stage decoding algorithm optimized to recognize very large number of entry names on a resource-limited embedded device. The multi-stage decoding algorithm is composed of a two-stage HMM-based coarse search and a detailed search. The two-stage HMM-based coarse search generates a small set of candidates that are assumed to contain a correct hypothesis with high probability, and the detailed search re-ranks the candidates by rescoring them with sophisticate acoustic models. In this paper, we take experiments with 1-millions of point-of-interest (POI) names on an in-car navigation device with a fixed-point processor running at 620MHz. The experimental result shows that the multi-stage decoding algorithm runs about 2.23 times realtime on the device without serious degradation of recognition performance.
منابع مشابه
In-vehicle destination entry by voice: practical aspects
Speech recognition has been shown to increase driver safety in car applications, if the application is well designed. Especially destination entry by voice is not only safer, but also faster and more convenient than the traditional haptic interfaces [1][2]. Building a high performing in-car destination entry system requires tackling a number of practical challenges. The vocabulary is very large...
متن کاملTechniques for robust speech recognition in the car environment
The use of voice commands or navigation features in the car is becoming a necessity. As keyboard and display interfaces cannot be used safely while driving, much effort has been done to make automatic speech recognition (ASR) and Text-to-Speech synthesis (TTS) ubiquitous features in the car. From voice dialing to car navigation, the requirements for voice technology vary greatly. While the use ...
متن کاملSpeech Finds its Way in Navigation Systems
In this white paper, Nuance examines how speech is redefining the way people interface with personal navigation devices (PNDs). As PNDs grow in popularity and gain recognition as essential tools for today's mobile consumer market, speech can add considerable value in terms of both safety and ease of use. In this paper, we explore how speech can be used to simplify information input, provide aud...
متن کاملContext-Based Face Control of a Robotic Wheelchair
In this article a method to perform semiautonomous navigation on a wheelchair is presented, contextual information from the environment as user’s habits and points of interest are employed to infere the user’s desired destination in a global map. Illogical steering signals comming from the usermachine interface input are filtered out to improve the overall performance of the system. Examples us...
متن کاملNew Developments in Speech Recognition for Car Navigation System
1. Introduction Mitsubishi Electric is striving to improve speech recognition capabilities for the car navigation system. One of the key problems encountered is when the user speaks out-of-vocabulary words which cannot be recognized. To solve this problem, the speech recognition system was equipped with a smart POI (point of interest) search function (1) , which automatically generates variatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009